Using our originally-pulled corpus of around 7,000 cases, I used Jimmy’s dictionary of terms to analyze it both as a complete set and on an individual-term basis.
Here is the raw dictionary of terms we looked for; for reference, terms were stemmed and lemmatized so that variants of the same term or phrase could be considered as one. Additionally, I removed stop words from all terms (common, non-substantive words) for ease of analysis with the full corpus.
| Original versus Stemmed Dictionary Terms | |
|---|---|
| Original Term | Stemmed Term |
| educational opportunity | educ opportun |
| equal opportunity | equal opportun |
| objective | object |
| intangible | intang |
| tangible | tangibl |
| engage with | engag |
| exchange views | exchang view |
| learn his profession | learn profess |
| feeling of inferiority | feel inferior |
| psychological | psycholog |
| substantial equality | substanti equal |
| prestige | prestig |
| alumni | alumni |
| substantial equality | substanti equal |
| reputation | reput |
| standing in the community | stand commun |
| traditions | tradit |
| intellectual commingling | intellectu commingl |
| aspirations | aspir |
| control their own destinies | control destini |
| self image | self imag |
| lower expectations | lower expect |
| stigma | stigma |
| sociological | sociolog |
| less measurable | less measur |
| oppportunity to compete | oppportun compet |
| antisocial attitudes and behavior | antisoci attitud behavior |
| made equal | made equal |
| melting pot | melt pot |
| diversity | divers |
| multiracial society | multiraci societi |
| social skills | social skill |
| non quantitative factor | non quantit factor |
| integrated educational experience | integr educ experi |
| participate fully | particip fulli |
| bi racial community | bi racial commun |
| meaningful integration | meaning integr |
| artificial advantage | artifici advantag |
| self perception | self percept |
| attitudinal effects | attitudin effect |
| grapple | grappl |
| realistic attitudes | realist attitud |
| negativism | negativ |
| play together and interact | plai togeth interact |
| get to know one another | get know on anoth |
| respect the other s differences | respect other' differ |
| tolerate each other | toler |
| more complex | complex |
| access | access |
| enclave | enclav |
| larger society | larger societi |
| determine success | determin success |
| majority culture | major cultur |
| two societies | two societi |
| mainstream of our society | mainstream societi |
| different world | differ world |
| self image | self imag |
| greater contacts | greater contact |
| greater understanding | greater understand |
| notwithstanding such equality | notwithstand equal |
| associate with children | associ children |
| co mingle | co mingl |
| extent of desegregation | extent desegreg |
| social equality | social equal |
| social matters | social matter |
| conduit | conduit |
| encounter of students | encount student |
| wide exposure | wide exposur |
| reputation of teachers | reput teacher |
| apartheid | apartheid |
| ability to learn | abil learn |
| precise measurement | precis measur |
| intangible vestiges | intang vestig |
| socioeconomic class consciousness | socioeconom class conscious |
| social scientific factors | social scientif factor |
| characteristics for future success | characterist futur success |
| preparing for meeting life | prepar meet life |
| no known yardstick | known yardstick |
| difficult to measure | difficult measur |
| beauty of the campus | beauti campu |
| develop relationships | develop relationship |
| dominant class | domin class |
| opportunity networks | opportun network |
| white social networks | white social network |
| old boy networks | old boi network |
| higher standing | higher stand |
| folkways | folkwai |
| perceptions of the white majority | percept white major |
| accredidation | accredid |
| alumni | alumni |
| standing of the institution | stand institut |
| status | statu |
| determine success | determin success |
| outside the classroom | outsid classroom |
| relations of men to one another | relat men on anoth |
| positions of influence and power | posit influenc power |
| form acquaintance | form acquaint |
| dominant figures | domin figur |
| to know and to be known | know known |
| confine his association | confin associ |
| being equalized | equal |
| student interaction | student interact |
| environment of a multi racial community | environ multi racial commun |
| cultural value | cultur valu |
| assimilation | assimil |
| influence | influenc |
| affluence | affluenc |
| jealously guard | jealous guard |
| traditionally closed | tradition close |
| alumni certificate | alumni certif |
| traditional society barriers | tradit societi barrier |
| social pattern | social pattern |
| personally acquainted | person acquaint |
| pale shadow | pale shadow |
| stature | statur |
Next, I wrote a customized script to track the occurrences of variable-length phrases in a corpus (amazingly, this did not appear to exist in any package or Stack Overflow post I could find). I ran it with our dictionary and the full relevant corpus (specified as 1950-1974 per Jimmy’s specifications).
First, I did just a raw count of the dictionary occurring as a set over time:
## Warning: Removed 1 row(s) containing missing values (geom_path).
However, this raw count doesn’t paint the full picture, so I also did a relative count of terms. This is simply the percentage of terms in a given year that were terms from our dictionary.
Next, I ran some quick rankings. First up is a ranking of terms by overall counts:
| Top Terms | |
|---|---|
| Term | Total Uses of Dictionary Terms |
| equal | 5003 |
| substanti equal | 2360 |
| object | 2171 |
| made equal | 1662 |
| equal opportun | 1466 |
| educ opportun | 1424 |
| engag | 1090 |
| determin success | 982 |
| statu | 932 |
| access | 637 |
| less measur | 575 |
| tradit | 559 |
| stand commun | 518 |
| stand institut | 493 |
| complex | 489 |
| social equal | 485 |
| social matter | 453 |
| associ children | 447 |
| extent desegreg | 435 |
| influenc | 426 |
| two societi | 411 |
| develop relationship | 356 |
| difficult measur | 355 |
| particip fulli | 345 |
| psycholog | 314 |
| know known | 308 |
| higher stand | 291 |
| greater understand | 290 |
| confin associ | 287 |
| precis measur | 287 |
| lower expect | 272 |
| social pattern | 244 |
| abil learn | 236 |
| divers | 227 |
| notwithstand equal | 224 |
| reput | 216 |
| toler | 205 |
| differ world | 198 |
| greater contact | 170 |
| feel inferior | 158 |
| major cultur | 150 |
| exchang view | 149 |
| meaning integr | 144 |
| cultur valu | 129 |
| outsid classroom | 126 |
| domin class | 125 |
| social skill | 116 |
| tradition close | 113 |
| larger societi | 111 |
| domin figur | 103 |
| learn profess | 88 |
| tangibl | 85 |
| alumni | 84 |
| realist attitud | 80 |
| encount student | 75 |
| self imag | 72 |
| person acquaint | 62 |
| form acquaint | 60 |
| reput teacher | 60 |
| integr educ experi | 59 |
| artifici advantag | 55 |
| posit influenc power | 55 |
| sociolog | 55 |
| prepar meet life | 54 |
| intang | 47 |
| prestig | 45 |
| self percept | 43 |
| aspir | 42 |
| wide exposur | 42 |
| student interact | 41 |
| stigma | 39 |
| characterist futur success | 33 |
| alumni certif | 26 |
| assimil | 21 |
| opportun network | 20 |
| conduit | 19 |
| tradit societi barrier | 19 |
| intellectu commingl | 18 |
| co mingl | 17 |
| intang vestig | 17 |
| bi racial commun | 16 |
| known yardstick | 16 |
| statur | 16 |
| enclav | 15 |
| mainstream societi | 15 |
| beauti campu | 13 |
| grappl | 13 |
| control destini | 12 |
| melt pot | 9 |
| affluenc | 8 |
| get know on anoth | 8 |
| pale shadow | 8 |
| social scientif factor | 8 |
| negativ | 7 |
| plai togeth interact | 6 |
| jealous guard | 5 |
| multiraci societi | 5 |
| percept white major | 5 |
| apartheid | 4 |
| non quantit factor | 2 |
| old boi network | 2 |
| relat men on anoth | 2 |
| socioeconom class conscious | 2 |
| white social network | 2 |
| accredid | 1 |
| antisoci attitud behavior | 1 |
| attitudin effect | 1 |
| folkwai | 1 |
| oppportun compet | 1 |
Then, the top 25 cases with the highest dictionary term counts:
| Top Cases - By Raw Counts | |
|---|---|
| Case | Total Uses in Corpus |
| hobson-v-hansen | 905 |
| united-states-v-morgan | 575 |
| san-antonio-independent-school-dist-v-rodriguez | 574 |
| oliver-v-kalamazoo-board-of-education | 546 |
| oregon-v-mitchell | 384 |
| hart-v-community-sch-bd-of-brooklyn-ny-sch-d-21 | 366 |
| keyes-v-school-district-number-one-denver-colorado | 329 |
| baker-v-carr | 315 |
| swann-v-charlotte-mecklenburg-board-of-education | 294 |
| united-states-v-ei-du-pont-de-nemours-co | 288 |
| united-states-v-state-of-texas | 275 |
| morales-v-turman | 256 |
| telex-corp-v-international-business-machines-corp | 242 |
| united-states-v-hk-porter-company | 223 |
| abington-school-dist-v-schempp | 221 |
| briggs-v-elliott | 220 |
| gertz-v-robert-welch-inc | 218 |
| higgins-v-board-of-education-grand-rapids-mich | 207 |
| keyes-v-school-district-no-1-denver-colorado | 195 |
| stell-v-savannah-chatham-county-board-of-education | 191 |
| pennsylvania-assn-retd-child-v-commonwealth-of-pa | 186 |
| graves-v-barnes | 181 |
| stamps-v-detroit-edison-co | 176 |
| beer-v-united-states | 175 |
| bradley-v-milliken | 175 |
Then, the top 25 most frequent cases in terms of dictionary richness (percentage of dictionary words in its overall word count):
| Top Cases - By Dictionary Richness | |
|---|---|
| Case | Percent of Dictionary Terms in Case |
| hobson-v-hansen | 19.925143 |
| briggs-v-elliott | 16.897081 |
| briggs-v-elliott | 14.369693 |
| bush-v-orleans-parish-school-board | 13.190731 |
| bush-v-orleans-parish-school-board | 12.542373 |
| swann-v-charlotte-mecklenburg-board-of-education | 10.414453 |
| mcswain-v-county-board-of-education | 9.921671 |
| swann-v-charlotte-mecklenburg-board-of-education | 9.767442 |
| hart-v-community-sch-bd-of-brooklyn-ny-sch-d-21 | 9.334353 |
| cisneros-v-corpus-christi-independent-school-dist | 8.982512 |
| lee-v-macon-county-board-of-education | 7.285651 |
| swann-v-charlotte-mecklenburg-board-of-education | 7.158510 |
| swann-v-charlotte-mecklenburg-board-of-education | 7.123819 |
| swann-v-charlotte-mecklenburg-board-of-education | 6.571301 |
| hobson-v-hansen | 6.464747 |
| keyes-v-school-district-number-one-denver-colorado | 6.303890 |
| keyes-v-school-district-number-one-denver-colorado | 5.895001 |
| norwalk-core-v-norwalk-board-of-education | 5.812221 |
| keyes-v-school-district-number-one-denver-colorado | 5.746725 |
| hobson-v-hansen | 4.824351 |
| oliver-v-kalamazoo-board-of-education | 4.465892 |
| moses-v-washington-parish-school-board | 4.419036 |
| evans-v-buchanan | 3.761419 |
| morales-v-turman | 3.670251 |
| united-states-v-state-of-texas | 3.661784 |
Next, the top 25 courts with the highest dictionary term counts:
| Top Courts Using Dictionary Terms - By Raw Counts | |
|---|---|
| Court | Total Uses of Dictionary Terms |
| scotus | 4099 |
| nysd | 2085 |
| dcd | 1756 |
| laed | 1119 |
| paed | 1077 |
| vaed | 1068 |
| ilnd | 1011 |
| miwd | 969 |
| txsd | 799 |
| almd | 788 |
| nyed | 786 |
| mied | 722 |
| mdd | 665 |
| txed | 577 |
| alnd | 571 |
| cod | 565 |
| ded | 555 |
| ncwd | 497 |
| ared | 461 |
| pawd | 443 |
| cand | 441 |
| txwd | 430 |
| mssd | 394 |
| gand | 379 |
| southcarolinaed | 359 |
And the top 25 courts in terms of dictionary richness (percentage of dictionary words across all of a court’s text):
| Top Courts - By Dictionary Richness | |
|---|---|
| Court | Percent of Dictionary Terms for all Cases |
| nmd | 1.5140045 |
| kyed | 0.8928571 |
| miwd | 0.8294316 |
| sdd | 0.8205128 |
| nyed | 0.8038125 |
| utd | 0.8011653 |
| nvd | 0.7970849 |
| caed | 0.7892204 |
| ctd | 0.7256058 |
| wied | 0.7232585 |
| southcarolinaed | 0.7017750 |
| gasd | 0.6936031 |
| nhd | 0.6800151 |
| ord | 0.6653897 |
| pamd | 0.6603588 |
| txed | 0.6437002 |
| ohnd | 0.6401915 |
| prd | 0.6311993 |
| cod | 0.6298142 |
| wiwd | 0.6246042 |
| tned | 0.6173435 |
| dcd | 0.5977893 |
| scotus | 0.5949936 |
| oked | 0.5920079 |
| cand | 0.5735541 |
And finally, here are graphs for each individual word - both raw and relative counts.
## Warning: Removed 4 row(s) containing missing values (geom_path).
## Warning: Removed 1 row(s) containing missing values (geom_path).
## Warning: Removed 8 row(s) containing missing values (geom_path).
## Warning: Removed 10 row(s) containing missing values (geom_path).
## Warning: Removed 1 row(s) containing missing values (geom_path).
## Warning: Removed 4 row(s) containing missing values (geom_path).
## Warning: Removed 24 row(s) containing missing values (geom_path).
## Warning: Removed 24 row(s) containing missing values (geom_path).
## Warning: Removed 3 row(s) containing missing values (geom_path).
## Warning: Removed 17 row(s) containing missing values (geom_path).
## Warning: Removed 3 row(s) containing missing values (geom_path).
## Warning: Removed 20 row(s) containing missing values (geom_path).
## Warning: Removed 8 row(s) containing missing values (geom_path).
## Warning: Removed 13 row(s) containing missing values (geom_path).
## Warning: Removed 8 row(s) containing missing values (geom_path).
## Warning: Removed 2 row(s) containing missing values (geom_path).
## Warning: Removed 24 row(s) containing missing values (geom_path).
## Warning: Removed 17 row(s) containing missing values (geom_path).
## Warning: Removed 1 row(s) containing missing values (geom_path).
## Warning: Removed 15 row(s) containing missing values (geom_path).
## Warning: Removed 11 row(s) containing missing values (geom_path).
## Warning: Removed 2 row(s) containing missing values (geom_path).
## Warning: Removed 25 row(s) containing missing values (geom_path).
## Warning: Removed 15 row(s) containing missing values (geom_path).
## Warning: Removed 14 row(s) containing missing values (geom_path).
## Warning: Removed 3 row(s) containing missing values (geom_path).
## Warning: Removed 5 row(s) containing missing values (geom_path).
## Warning: Removed 18 row(s) containing missing values (geom_path).
## Warning: Removed 1 row(s) containing missing values (geom_path).
## Warning: Removed 1 row(s) containing missing values (geom_path).
## Warning: Removed 24 row(s) containing missing values (geom_path).
## Warning: Removed 7 row(s) containing missing values (geom_path).
## Warning: Removed 3 row(s) containing missing values (geom_path).
## Warning: Removed 1 row(s) containing missing values (geom_path).
## Warning: Removed 5 row(s) containing missing values (geom_path).
## Warning: Removed 1 row(s) containing missing values (geom_path).
## Warning: Removed 3 row(s) containing missing values (geom_path).
## Warning: Removed 23 row(s) containing missing values (geom_path).
## Warning: Removed 20 row(s) containing missing values (geom_path).
## Warning: Removed 24 row(s) containing missing values (geom_path).
## Warning: Removed 14 row(s) containing missing values (geom_path).
## Warning: Removed 25 row(s) containing missing values (geom_path).
## Warning: Removed 18 row(s) containing missing values (geom_path).
## Warning: Removed 3 row(s) containing missing values (geom_path).
## Warning: Removed 25 row(s) containing missing values (geom_path).
## Warning: Removed 2 row(s) containing missing values (geom_path).
## Warning: Removed 18 row(s) containing missing values (geom_path).
## Warning: Removed 16 row(s) containing missing values (geom_path).
## Warning: Removed 5 row(s) containing missing values (geom_path).
## Warning: Removed 5 row(s) containing missing values (geom_path).
## Warning: Removed 1 row(s) containing missing values (geom_path).
## Warning: Removed 13 row(s) containing missing values (geom_path).
## Warning: Removed 11 row(s) containing missing values (geom_path).